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We study remembering Turing machines, that is Turing machines with the 
capability to access freely the history of their computations, These devices can 
detect in one step via the oracle mechanism whether the storage tapes have exactly 
the same contents at the moment of inquiry as at some past moment in the com- 
putation. The s(n)-space-bounded remembering Turing machines are shown to be 
able to recognize exactly the languages in the time-complexity class determined by 
bounds exponential in s(7). This is proved for deterministic, non-deterministic, and 
alternating Turing machines. © 1990 Academic Press, Inc. 


1. INTRODUCTION 


We investigate the computational power of Turing machines augmented 
with the capability of accessing freely the history of their computations, 
without the necessity of storing any record of it. We call such Turing 
machines the remembering Turing machines (R-TMs, in short). Technically 
R-TMs resemble oracle TMs. They have a special query state distinguished, 
and if an R-TM enters this state then in the next step it enters one of two 
other special states, depending on whether or not, respectively, the symbols 
written on the tapes are exactly the same as those in a certain past 
configuration. 

The main result of this paper is a relationship between the time- and 
space-bounded R-TMs and the respective ordinary TMs. First we show 
that the time-bounded remembering and ordinary TMs are polynomially 
related. On the other hand, it turns out that the s(7)-space-bounded 
R-TMs recognize the languages in the time complexity classes determined 
by bounds c, for c> 1. This result holds, respectively, for deterministic, 
non-deterministic, and alternating remembering TMs (DR-TMs, NR-TMs, 
and AR-TMs, in short). We show that there are possible certain restric- 
tions on the NR-TMs and AR-TMs which preserve the exponential 
relationship between the space-bounded R-TMs and the time-bounded 
ordinary TMs. The NR-TMs can be restricted to have the “no”-state (that 
is one of the special states) as the only final rejecting state, and the 
AR-TMs can be restricted to be able to enter the query state at most once 
in every computation. 
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The alternating TMs with either absolute or conditional restrictions on 
looping were previously investigated by Robson (1985). One of the specific 
models he studied, the non-looping ATM, was defined so that computations 
were considered valid only if they did not contain repetitions of configura- 
tions. For the non-looping ATMs Robson (1985) proved a theorem 
analogous to the part of Theorem 3.1 concerning alternating remembering 
TMs (actually he proved a stronger result described as Proposition 3.7 in 
this paper). We generalize his result to the deterministic and non-deter- 
ministic TMs. The proof given in this paper covers all three kinds of TMs. 
It is simple and intuitive, which is mainly due to domino tilings being 
unsed instead of direct manipulations with Turing machines. A key distinc- 
tion in our approach to the very definition of a “history-dependent” mode 
of computations is the presence of the oracle mechnism. This is what 
actually enables us to carry over the generalization in the deterministic 
case. In the non-deterministic case Theorem 3.1 can be proved for non- 
looping TMs, as is shown in Section 5. 

The AR-TM has a natural interpretation as a computing device model- 
ling games in which the “value” of a position, for example, whether it is a 
winning position or not, depends on the previous positions entered in the 
course of the play (cf. Lichtenstein and Sipser, 1980; Robson, 1984). Such 
history-dependent games are formally neither perfect nor partial- 
information games. In this way the theory of the non-looping ATMs and 
the remembering ATMs is a contribution to the program initiated by 
Chandra, Kozen, and Stockmeyer (1981), who introduced alternation as a 
mode of computation modelling the perfect-information games. Then 
Peterson and Reif (1979) and Reif (1979) introduced other “alternations,” 
namely the multiple-person one (Peterson and Reif, 1979) and the private 
and blindfold ones (Reif, 1979) (see also Jones, 1978). The underlying 
games in these new alternations are of partial information, and in the 
multiple-person alternation there are moreover two teams of players rather 
then two single players (Peterson.and Reif, 1979). Recently Ladner and 
Norman (1985) studied the solitaire automata, that is such private- 
alternation TMs in which the game is started by player 0 (the player who 
aims at rejection of the input) and once player 1 enter the game, player 0 
acts deterministically. The main results of the papers cited above concerned 
mutual relationships between the complexity classes of the various alterna- 
tions and the deterministic and non-deterministic classes. 


2. DEFINITIONS 


In this section we define the R-TMs and recall certain facts concerning 
domino-tiling problems. 
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2.1. Remembering Turing Machines 


The R-TM M is a Turing machine M with three special states: the query 
state q;, the “yes”-state q,, and the “no”-state qy. From q, M enters either 
qy OF qy, depending on whether or not, respectively, there has been before 
a configuration in the computation identical to the present configuration 
except for the state and the head positions. We say that an R-/oop has been 
detected if M is in gy. The R-TMs are considered in three versions: the 
deterministic, non-deterministic, and alternating ones. The respective 
semantics of their computations are natural extensions of those of the 
original models (cf. Chandra et al. 1981; Hopcroft and Ullman, 1979). 
Alternation in the R-TMs can be conveniently interpreted as a game 
process in the following standard way (see Chandra et al., 1981; Chlebus, 
1986; Grddel, 1987; Ladner and Norman, 1985; Peterson and Reif, 1979; 
Reif, 1979; Robson, 1985). There are two players: the existential player (of 
preference) whose aim is to lead the TM to acceptance, and the universal 
player who aims at rejection of the input. When an AR-TM M enters an 
existential state, then the next move is decided by the existential player, 
and when M is in a universal state, the next move is chosen by the univer- 
sal player. The query state is neither existential nor universal. The existen- 
tial player wins the game if M enters an accepting state. We refer to this 
game as the M-game. Given an input x, M accepts x iff there is a winning 
strategy for the existential player in the M-game starting at the initial con- 
figuration with input x. The time of a winning strategy S is the least upper 
bound on the times of the plays against all the strategies of the universal 
player, when the existential player employs S. M is said to accept within 
time t(n) if, for each input x of length n, there is a winning strategy S such 
that the time of S on input x is not greater than t(n). The space bounds are 
defined analogously. The time- and space-bounded deterministic and non- 
deterministic R-TMs are defined in the standard way (see Hopcroft and 
Ullman, 1979). The complexity classes DR-TIME(t(n)), NR-TIME(t(n)), 
and AR-TIME(?(n)) are the classes of languages recognized by the 
DR-TMs, NR-TMs, and AR-TMs, respectively, in time t(n). The space 
complexity classes DR-SPACE(s(#)), NR-SPACE(s()), and AR- 
SPACE(s(n)) are defined analogously. We use the following notations: if X 
denotes one of the letters D, N, or A, then ¥R-TIME(EXP(f(n))) = 
Ue>o XR-TIME(c/); the class XR-SPACE(EXP(f(n))) is defined 
similarly. 


2.2. Bounded Domino-Tiling Problems 


The key parts of the proofs of the facts relating time- and space- 
bounded R-TMs and the ordinary TMs, which are described in the sequel, 
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use certain results concerning bounded domino-tiling problems. A domino 
type is a quadruple of colours. A unit square with edges coloured as 
specified by the type is called a domino tile of the given type. Domino tiles 
cannot be rotated or inverted. Suppose there is given a finite set T of 
domino types and a natural number m. A T-tiled m x m-square is an m x m- 
square grid covered by dominoes of types in T in such a way that each 
pair of adjacent tiles has the same colour along their common boundary, 
and the external boundaries are coloured some fixed colour, say, white (cf. 
Chlebus, 1986, 1987; Gradel, 1987; Harel, 1983; Lewis and Papadimitriou, 
1981; Savelsberg and van Emde Boas, 1984). There is a close relationship 
between tilings and Turing-machine computations. An attempt to formalize 
this relationship was undertaken in (Chlebus, 1985). Sometimes it is more 
convenient to deal with domino tilings rather than directly with Turing 
machines, and the proofs of Lemma 3.5, Proposition 3.6, and Proposi- 
tion 3.7 may serve as examples. Below we describe in detail the relevant 
facts concerning alternating TMs and domino-tiling games; similar facts 
relating the deterministic and non-deterministic TMs and the existential 
versions of domino-tiling problems follow similarly from the results 
described, for example, by Savelsberg and van Emde Boas (1984) (see also 
Chlebus, 1985, 1987; Harel, 1983; Lewis and Papadimitriou, 1981). 

Consider the following tiling game, for a given set T of domino types and 
a natural number m. There are two players: CONSTRUCTOR and 
SABOTEUR. They cover an m x m-square grid with dominoes of types in 
T in the course of a game. The moves are performed alternately, one tile 
is placed at a time, and CONSTRUCTOR starts the game. First the bot- 
tom row is filled from left to right, then the second one in the same order, 
and so on. A move is legal provided the selected tile has matching colours 
along their common boundaries with the tiles adjacent to the left (if any) 
and to below (if any). If after completing this process a 7-tiled m xm- 
square is obtained then CONSTRUCTOR wins. In the other case, that is 
if either a deadlock occurs and no legal move could have been performed 
or the external side of a tile placed on the border part of the grid was not 
white, then SABOTEUR wins. It was shown by Chlebus (1986) that for 
each alternating single-tape TM M there are sets of domino types such that 
the accepting computations of M correspond to the bounded tilings 
obtained during the tiling game. More precisely, there is a finite set T,, of 
domino types such that for each word x over the input alphabet of M there 
is a set T, of domino types with the following properties: 


(a) M accepts x in time ¢(|x|) iff CONSTRUCTOR has a forced win 
in the 2t(|x|) x 2z(|x|)-square Tyo T,-tiling game; 


(b) The bottom row of any Ty, T,-tiled k x k-square, for k > |x|, is 
unique and all the tiles occurring in it are from T, (they encode the initial 
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configuration of M on x by the colours on their top boundaries; the tiles 
from T, are used only in the bottom row). 


Other variants of domino-tiling games were studied by Grddel (1987, 
1988). 


3. TIME- AND SPACE-BOUNDED REMEMBERING TURING MACHINES 


The relationships between time- and space-bounded R-TMs and their 
non-remembering counterparts are summed up in the following theorem 
(cf. Robson (1985) in the case of alternation). 


3.1. THEOREM. Let X denote one of the letters D, N, or A. 


(a) XR-TIME(EXP(t(n))) = XTIME(EXP(t(n))), provided t(n)2>n 
and t(n) is time constructible. 

(b) XR-SPACE(s(n1)) = XTIME(EXP(s(n))), provided s(n) >logn 
and s(n) is space constructible. 


Part (a) of this theorem for X equal to A can be strengthened to the 
equality AR-TIME(?(7)) = ATIME(t(n))—see Robson (1985) for a proof. 
The alternating time and space complexity classes can be expressed as 
deterministic ones as was showed by Chandra, Kozen, and Stockmeyer 
(1981). After converting Theorem 3.1, for X= 4A, into the relationships 
between the alternating-remembering and deterministic complexity classes, 
one obtains the same relationships as Reif (1979) proved for the blindfold 
alternation. 

The proof of Theorem 3.1 is divided into four parts corresponding to the 
respective four inclusions. 


3.2. LEMMA. XR-TIME(t(n)) € YTIME(t3(n)), provided t(n)2n and 
t(n) is time constructible. 


Proof. Consider an XR-TM M operating in time t(n). The X-TM M, 
simulates M as follows, M, stores the previous configurations of M in con- 
secutive blocks of length t(n) on its tapes. To perform an ordinary move of 
M, M, lays off t(n) new cells on the tapes, and then decides the move and 
stores the next configuration in the allotted space. To handle a move from 
the query state, M, scans the previous configurations. The simulation of 
both an ordinary and a query move takes O(1?(n)) of time. The overall 
time is this O(17(n)). f 


3.3. LEMMA. XTIME(t(n))S XR-TIME(t(n)), provided t(n) =n. 
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Proof. Observe that YTMs are YR-TMs which never enter their query 
states. J 


Part (a) of Theorem 3.1 follows from these two lemmas. 


3.4. LEMMA. XR-SPACE(s(n))G XYTIME(c™) for a certain c>1, 
provided s(n) = log n and s(n) is space constructible. 


Proof. Let M be an XR-TM operating in space s(n). The XTM M, 
simulating M uses three storage tapes. The first tape simulates the storage 
tape of M. The second one stores the consecutive contents of the first tape 
of M, in the previous configurations, without repetitions. The third tape of 
M, stores a counter in the space s(n) and with base d, where d> 1 is a cer- 
tain number to be specified later on. To simulate a single step of M from 
a given configuration, M, decides the next configuration and updates the 
tapes. If this is an ordinary move of a TM, then it is handled in an obvious 
manner. A move from the query state is performed by first scanning the 
“history of computation” which is stored on the second tape, and then 
entering either gy or g, accordingly. After having completed this M, 
checks if the second tape should be updated, that is whether the contents 
of the first tape are not stored anywhere on the second tape. If this is the 
case then M, lays off s(n) new cells on the second tape and writes the 
contents of the first tape in this space. The counter on the third tape is 
incremented by 1 after completing a simulation of every move on M unless 
M enters qy, in this case the counter is set to 1. The computation of M, 
starts with the input word being written on both the first and the second 
tape, and the counter set to 1. M, accepts if M enters an accepting state; 
M, rejects if either M rejects or the counter reaches its maximal value. 

Let b be such a constant that there are no more than b" configurations 
of M in space s(n). It may happen that M needs to go through a number 
of configurations in order to continue the computation by entering the 
“yes”-oracle state when needed. Again it may happen that in order to reach 
such configurations a sequence of previously reached configurations should 
be repeated in the course of a computation. Observe that such a part of the 
computation contributing a new configuration must have length at most 
band, since there are at most b" distinct configurations, the number 
of such contributing computations is bounded by the same number. There- 
fore the number b° may be taken as d. 

The simulation of a single step of M requires O(d*”’) steps of M,. 
Therefore M, completes its computation within d** steps. J 


3.5. LEMMA. XTIME(EXP(s(n))) S XR-SPACE(s(n)), provided s(n) > 
log n and s(n) is space constructible. 


Proof. Let M be an XTM operating in time c” for a c>1. The 
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required s(n)-space-bounded XR-TM M, equivalent to M simulates the 
process of constructing a suitable domino tiling. To describe M, we use the 
notations and properties concerning the bounded domino-tiling problems 
described in the previous section. Given and input x, denote by D, the set 
TyUTy of domino types, and by r, the number 2c"). M, has four 
storage tapes: fg, t1, f2, and ¢;. Tape to stores a single element of D,,. Tapes 
tı and t, each store a natural number not greater than r,. These numbers 
are equal to the column and row numbers, respectively, of the processed 
position on the tiling. More precisely, tapes r, and t, operate as c-ary coun- 
ters in space 2s(|x|). The operation of “moving” to neighbouring positions 
in the tiling is performed by incrementing or decrementing the counters 
accordingly. Typical contents of the tapes of M, can be interpreted as 
follows: the domino stored on fy lies in the column with the number writ- 
ten on z; and the row with the number written on t, of the tiling being 
“built” in the course of the computation. M, begins its computation by 
writing the sequence of all the domino types in Tą on tape t}. Positioning 
the head on such a type will be used as a means to remember it. (Note that 
Ty, is a fixed set independent of the input.) 

The tiling is constructed by selecting repeatedly first the tiles in the bot- 
tom row from left to right, then those in the second row in the same order, 
and so on. The selection of a consecutive domino is performed as follows. 
Suppose M, has just selected the tile, say, d, adjacent to the left of the pro- 
cessed vacant position, d} is stored on fg, and tapes r, and ż, contain the 
coordinates of d, in the tiling. M, begins by erasing tọ and remembering d, 
by moving the head on tape f, to the position of type d,. Then M, 
decrements the ¢,-counter by 1, thus “moving” to the row below, and next 
increments the ¢,-counter, thereby “moving” right to the next column. M, 
wants to get to know which domino tile has been placed at this position. 
To this end M, resorts to its power to remember the history. It repeatedly 
places consecutive dominoes on fy and asks the “oracle” whether the tapes 
looked exactly the same at some moment in the past. When an R-loop is 
detected eventually, then M, knows both the domino d, adjacent to the 
left, and the domino, say, d, adjacent to below the vacant position. M, 
selects a domino (if there is any) matching the right-side colour on d, and 
the top-side colour on d,, places it on fy, and increments the row counter 
by 1. After this step the next consecutive tile (adjacent to the right of the 
last position) is started to be selected in the same manner as that described 
above. If M is deterministic, then there is at most one tile fitting in the 
current vacant position; if M is non-deterministic, then M, chooses non- 
deterministically a proper domino; and finally if M is an alternating TM, 
then M, simulates the tiling game so that the moves of CONSTRUCTOR 
are being performed by the existential player, and the moves of 
SABOTEUR by the universal player. 
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It remains to describe the actions of M, when it is on the border part of 
the tiling, that is with one of the counters storing either 1 or r,. The bot- 
tom row of tiles is uniquely determined: the first dominoes encoding the 
input are obtained through scanning the input, and the remaining part is 
filled with “blanks” up to the moment when the column counter on t; 
reaches its maximal value. Then the row counter on f, is incremented by 
1 and the column counter on f, is set again to 1. This is always done when 
the right border is reached and the top one has not yet been reached. After 
a new domino on the border is selected then it is additionally verified to 
have the respective side (or sides) coloured white. If two counters are full 
and a proper domino completing a perfect tiling is selected then M, 
accepts. 

The storage of M, is dominated by the counters which require 2s(7) 
cells. This can be easily decreased to exactly s(n) cells (see Hopcroft and 
Ullman, 1979), thus providing the required space bound. J 


This completes the proof of Theorem 3.1. Next we give two variants of 
this theorem. Proposition 3.6 could be proved by a modification of the 
proof of Lemma 3.5. We give another proof which demonstrates particular 
capabilities of NR-TMs. This proof is next adapted to alternation to yield 
Proposition 3.7. 


3.6. PROPOSITION. Theorem 3.1 holds true for non-deterministic remem- 
bering TMs if they are restricted to have the “no”-state qy as the only 
rejecting final state. 


Proof. It is enough to prove Lemma 3.5 under this restriction. We need 
to show that there is a NR-TM M, constructing the domino tiling and 
having the “no”-state as the only rejecting state. M, is similar to the TM 
from the previous proof; the difference being that it does not have tape 1. 
In the first stage M, guesses a tilling row by row, enforcing only the 
adjacency constraints along the vertical edges of the tiles. Selecting a new 
tile is performed by incrementing the column counter by 1 and writing such 
a new domino on fy which may be adjacent to the right of the one before. 
That last condition is guaranteed by the next-move relation of M,. If the 
column counter reaches the maximal value then the row counter is 
incremented by 1, the column counter is set to 1, and a new row is started 
to be guessed. The boundary dominoes are also checked to have the respec- 
tive external sides coloured white. After the first stage is completed, that is 
after r, perfect rows have been constructed, M, performs the second stage. 
Now it tries to guess the same tiling in the column-by-column manner, 
enforcing the adjacency constraints along the horizontal boundaries by its 
next-move relation. Each time a new tile is decided, M, verifies whether it 
has been placed in this position during the first stage of the computation. 
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To this end M, enters the query state qg,. If M, goes into qy then M, 
proceeds to guess the next domino, else, if M, enters qy, then M, rejects. 
M, accepts after the whole tiling has been guessed for the second time. J 


3.7. PROPOSITION. Theorem 3.1 holds true for the alternating-remember- 
ing TMs if they are restricted to be able to enter the query state q, at most 
once in the course of a computation. 


Proof. Again it is enough to reprove Lemma 3.5. We use the same nota- 
tions as those in the proofs of Lemma 3.5 and Proposition 3.6. The way the 
AR-TM M, simulates the domino-tiling game determined by D, and r, is 
similar to the first stage of computation of the M, from the previous proof. 
The tiling is guessed row by row, and the next-move relation of M, 
guarantees the proper adjacencies only along the vertical boundaries of the 
tiles. To enforce selections of tiles matching also the colours below, each of 
the players has the capability to control the moves of his adversary. If a 
player tries to “cheat” and places a tile with improper relationship to the 
tile below then the other player can win the M,-game by proving that the 
move of the adversary was illegal. Technically this is done as follows. Sup- 
pose that one of the players has just selected a new domino tile and placed 
it on the tape fy. The adversary can either challenge this selection or “skip” 
a verification and select a consecutive tile adjacent to the right. If he 
chooses to verify then he does the following: he selects a domino tile not 
matching below, writes it on tj, decrements the row counter by 1, and cen- 
ters q;. From this state M enters either qy or qy. If the former is the case 
then the player who has challenged wins the game, in the other case he 
loses it. Observe that if M enters q y then this is a proof that the player who 
selected the last domino above tried to cheat and so he ought to lose. If, 
on the other hand, M enters qy, then it means that the player who decided 
to check has failed to provide such a proof. In this case the challenging 
player can be regarded as a loser since he did not have to verify selections 
of his adversary. M, at this moment is either in qy or in qy and does not 
know who is the winner, but this can be found out by calculating whose 
turn it was to select a new tile, and hence who decided to verify the last 
selection. To carry out the calculations answering this question M, uses the 
values of the counters on ¢, and t,. Knowing the answer and the fact of 
whether an R-loop has been detected or not, M, accepts of rejects accor- 


dingly. Jj 


4. CONCLUSIONS 


The main result of this paper, Theorem 3.1, is a generalization of a 
theorem of Robson (1985) relating non-looping ATMs to ordinary ATMs. 
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It concerns deterministic and non-deterministic TMs. To carry out the 
generalization an apparently more flexible model of a TM capable 
of resorting to the history of computation by way of an oracle was 
introduced. The viability of this model is demonstrated in the proofs of 
theorems in Section 3. 

There is a natural question whether an analogue of Theorem 3.1 holds 
true for the deterministic and non-deterministic non-looping TMs (defined 
along the lines of Robson (1985)). The answer in the case of DTMs is 
negative since the ordinary and non-looping variants are clearly equivalent. 
It turns out that Theorem 3.1 holds true for non-looping NTMs. The most 
difficult part is Lemma 3.5. To prove it in a similar way it is enough to 
have a mechanism by which a non-looping NTM can verify that a 
particular tile type was placed at a particular grid position. To this end it 
is sufficient to verify that every other tile type was not placed there. This 
verification can be carried out by a non-looping NTM. 

R-TMs also can be interpreted as computing devices having an auxiliary 
space to store their history such that both the amount of it and the time 
to scan it are “for free.” A similar idea of studying computations capable 
of resorting to “hidden” resources was previously explored by 
Mager (1969) and Cook (1971). They investigated the auxiliary pushdown 
automata (APDAs, in short), that is TMs which have an auxiliary memory 
organized as a pushdown store. The space complexity of such devices is 
defined by the amount of the general-purpose storage used, while the stack 
space is not counted. Cook (1971) proved that deterministic and non-deter- 
ministic APDAs with the same space bounds are equivalent, and moreover 
that U.. 9 DTIME(c"”’) is exactly the class of languages recognized by 
APDAs using space s(n). It follows from Theorem 3.1 that both the 
APDAs and DR-TMs with s(n)-space bound recognize the same class of 
languages. 
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